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ABSTRACT 

The  increasing  threat  of  microbial  weapons  of  mass  destruc¬ 
tion  (WMD)  creates  a  critical  need  for  rapid  development 
and  deployment  of  target-specific  microsensor  detection  sys¬ 
tems.  Next-generation  biosensor-dependent  technologies  must 
incorporate  intelligence  into  the  microsensor  platform,  en¬ 
abling  the  execution  of  complex  detection  tasks,  such  as  sys¬ 
tematic  identification  and  classification  of  microbial  agents 
and  genetically  modified  organisms  (GMOs)  in  the  pres¬ 
ence  of  non-lethal  agents.  Using  an  information  and  coding- 
theoretic  framework  we  develop  a  de  novo  method  for  map¬ 
ping  a  generic  bio-detection  and  classification  problem  to 
deoxyribozyme-compliant  algorithms.  We  implement  an  al¬ 
gorithm  on  our  novel  computational  biosensor  system  and  as 
a  proof-of-concept  develop  an  application-specific,  computa¬ 
tional  biosensor  for  concurrent  detection  and  classification  of 
H5N1  regional  strains. 

1.  INTRODUCTION 

Hybridization-based  target  recognition  and  discrimination 
is  central  to  a  wide  variety  of  applications:  high  through¬ 
put  screening,  distinguishing  genetically  modified  organisms 
(GMOs),  molecular  computing,  differentiating  biological  mark¬ 
ers,  fingerprinting  a  specific  sensor  response  for  complex  sys¬ 
tems,  etc.  The  recognition  substrate  can  exist  in  solution  or 
be  immobilized  onto  a  transducer  and  hybridization  events 
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can  be  detected  optically,  electrochemically,  or  via  a  mass- 
sensitive  device  [11].  The  bioreceptor,  or  probe,  is  critical 
to  the  specificity  of  the  biosensor.  Although  several  single- 
stranded  DNA  sensor  technologies,  such  as  DNA  microar¬ 
rays,  are  widely  used,  molecular  beacon  probes  are  highly 
sensitive  and  specific  bioreceptors  [11,  3,  12].  They  can  de¬ 
tect  mutations  in  target  sequences  and  can  be  multiplexed 
[3];  these  properties  make  molecular  beacons  effective  plat¬ 
forms  for  detecting  genetically  modified  targets  in  biodefense 
systems. 

1.1  Deoxyribozyme  Molecular  Beacons 

Molecular  beacons  are  single-stranded  oligonucleotide  probes 
that  form  stem-loop  structures.  The  loop  contains  a  probe 
sequence  that  is  complementary  to  a  target  sequence.  Tradi¬ 
tional  molecular  beacons  contain  a  fluorophore  and  quencher 
on  each  arm  of  the  stem  of  the  beacon.  Fluorescence  is 
achieved  by  separation  of  the  fluorophore  and  quencher  due 
to  a  conformation  change  that  takes  place  following  target 
hybridization  to  the  loop  structure  [9] .  However,  traditional 
beacons  are  limited  by  a  1:1  (target:  signal)  stoichiometry, 
and  the  sensitivity  of  the  detection  is  linked  to  the  amount 
of  target  present.  Target  amplification  (PCR)  is  required  to 
increase  the  level  of  sensitivity. 

Though  DNA  serves  primarily  as  a  carrier  of  the  genetic  code 
and  no  enzymes  made  of  DNA  have  been  found  in  nature, 
molecular  beacons  comprised  of  single-stranded  DNAs  can 
be  engineered  to  perform  catalytic  reactions  similar  to  those 
of  protein  and  RNA.  Catalytic  DNAs  or  deoxyribozymes 
are  synthesized  in  the  laboratory  via  an  in  vitro  iterative 
selection  process. 

Like  traditional  molecular  beacons,  catalytic  molecular  bea¬ 
cons  also  contain  a  stem-loop  structure  that  undergoes  a 
conformational  change  following  target  hybridization  to  the 
loop  region.  Unlike  traditional  molecular  beacons,  catalytic 
molecular  beacons  are  modular  molecules,  containing  a  de¬ 
oxyribozyme  appended  to  the  stem-loop  structure  [9[.  Cat¬ 
alytic  activity  is  initiated  by  a  target  DNA  sequence  binding 
to  the  loop  region  that  is  distinct  from  the  enzymatic  active 
site.  This  allosterically  activates  the  deoxyribozyme  com¬ 
plex  to  bind  and  cleave  a  labeled  substrate  oligonucleotide 
molecule,  producing  a  detectable  signal.  Once  activated,  the 
catalytic  molecular  beacon  will  continuously  cleave  labeled 
substrate  molecules.  Target  amplification  is  not  required 
since  signal  amplification  is  obtained  through  repeated  pro- 
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cessing  of  excess  substrate  molecules  due  to  the  recognition 
and  binding  of  a  single  target.  Single  and  multi-receptor 
site  configurations  enable  the  deoxyribozyme  to  function  as 
YES,  NOT,  or  multi-input  AND  gates  [10]. 

Current  high-throughput  DNA  sensing  systems  rely  heavily 
on  silicon-based  computing  for  interpretation  of  molecular 
recognition  events.  Complex  bioinformatics  algorithms  are 
tasked  with  de-noising  and  processing  of  sensor  output  sig¬ 
nals.  This  approach  is  error  prone  and  not  easily  integrated 
into  emerging  nanoscale  sensor  architectures  for  lab-on-chip 
systems.  Molecular  computation,  although  currently  im¬ 
practical  for  real  time  computing  needs,  offers  the  ability 
for  massively  parallel  computation  and  detection,  and  is 
a  promising  technology  base  for  the  development  of  next 
generation  hybrid  molecular  sensor  systems.  In  vitro  com¬ 
putation  using  deoxyribozyme  gates  enables  the  design  of 
complex  biosensor  systems  and  reduces  reliance  on  classical 
computing  for  interpretation  of  sensor  response. 

In  this  work  we  describe  the  development  of  a  deoxyribozyme- 
based  computational  biosensor.  In  Section  2  of  this  paper  we 
discuss  the  methods  for  developing  a  computational  biosen¬ 
sor  for  concurrent  detection  and  classification  of  targets  us¬ 
ing  deoxyribozyme  gates.  In  Section  3  we  demonstrate  the 
use  of  our  sensor  platform  in  the  de  novo  design  and  de¬ 
velopment  of  biosensor  systems  for  detecting  genetic  modi¬ 
fications  in  avian  influenza  and  identifying  a  sample  strain’s 
region  of  origin. 

2.  METHODS 

We  have  devised  a  multi-step  process  for  rapid  de  novo 
design,  development  and  fabrication  of  target-specific  mi¬ 
crosensor  detection  systems.  Given  a  “target,”  we  begin  with 
a  bioinformatical  analysis  and  identify  an  initial  set  of  can¬ 
didate  probe  sequences.  Next,  using  the  initial  probe  set, 
we  generate  a  problem  specific  detection/classification  algo¬ 
rithm.  The  type  and  complexity  of  the  algorithm  depends 
on  the  problem  our  system  will  evaluate.  Phase  two  pro¬ 
duces  multiple  intermediate  probe  sets  with  corresponding 
deoxyribozyme-compliant  algorithms.  This  information  is 
the  input  to  phase  three,  during  which  we  evaluate  the  feasi¬ 
bility  of  the  candidate  probe  sets  and  associated  algorithms. 
Gate-  and  system-level  performance  is  verified  through  sim¬ 
ulation.  The  optimal  design  then  enters  phase  four,  which 
consists  of  system  fabrication  and  experimental  validation. 
We  apply  our  process  to  the  de  novo  design  and  develop¬ 
ment  of  computational  biosensors  for  1)  detection  of  key 
mutations  in  avian  influenza  as  reported  in  [5];  2)  detection 
of  geographical  strains  of  avian  influenza,  which  is  described 
in  this  work. 

2.1  Distinguishing  Geographical  Strains  of  H5N1 

Surveillance  plus  antigenic  and  genetic  analysis  are  used  to 
monitor  known  H5N1  variants  and  identify  emerging  sublin¬ 
eages  [8].  Continued  monitoring  in  regional  hotspots,  such 
as  southern  China  and  Southeast  Asia,  can  aid  in  evalu¬ 
ating  the  effectiveness  of  disease  control  protocols  and  in 
identifying  the  emergence  of  potentially  pandemic  strains. 
Large-scale  sequencing  efforts,  such  as  the  St.  Jude  In¬ 
fluenza  Genome  Project  [7],  provide  genetic  information  for 
the  design  of  computational  biosensors  that  can  aid  in  track¬ 
ing  and  characterizing  regional  strains  of  H5N1.  We  discuss 


the  design  and  feasibility  of  such  systems. 

2.2  Informational  analysis  and  extraction  of  candidate 
marker  regions. 

We  downloaded  hemagglutinin  (HA)  gene  sequences  of  H5N1 
influenza  samples  from  the  NCBI  Influenza  Virus  Resource 
[1].  In  all,  we  used  164  sequences  from  samples  spanning  five 
different  geographic  locales:  Egypt,  Hong  Kong,  Indonesia, 
Thailand,  and  Vietnam.  We  generated  a  multiple  alignment 
and  consensus  sequence  for  each  locale  using  MUSCLE  [2] 
with  the  default  parameters,  then  created  a  multiple  align¬ 
ment  of  the  five  consensus  sequences,  in  order  to  index  the 
nucleotides  across  all  sequences  uniformly.  To  find  regions  of 
low  variation  within  a  locale,  we  used  an  information  theory 
approach  that  yielded  996  candidate  probes.  We  reduced  the 
region  classification  problem  to  an  experimentally  tractable 
form:  develop  a  deoxyribozyme-compliant  algorithm  that 
detects  and  classifies  a  candidate  strain  as  originating  from 
Egypt,  Hong  Kong,  or  Indonesia,  using  a  reduced  probe  set 
of  75  candidate  probes.  The  reduced  probe  set  was  selected 
from  the  996  candidate  set  based  on  their  classification  sen¬ 
sitivity  values.  We  further  pruned  our  probe  set  to  52  can¬ 
didate  probes  by  filtering  out  probes  that  did  not  produce 
viable  deoxribozyme  structures  or  produced  undesirable  sec¬ 
ondary  structures. 

2.3  Design  of  realizable  deoxyribozyme-compliant  clas¬ 
sification  algorithms. 

Using  methods  from  communication  and  coding  theory  [6,  4] 
we  generate  constraints  for  producing  an  efficient  deoxyribozyme- 
compliant  classification  algorithm  using  a  minimal  set  of 
probes.  We  devised  a  reduced  (N=52,K=47)  coding-based 
algorithm  for  subtype  classification  that  uses  nine  probes  to 
detect  and  classify  an  H5N1  nucleic  acid  sample  as  origi¬ 
nating  from  Egypt,  Hong  Kong,  or  Indonesia.  The  three- 
bit  classification  algorithm  is:  CompWelll  =  probe23  © 
probe48;  CompWell2  =  probe4  ©  probe23  ©  probe24  © 
probe34  ©  probe43  ©  probe50;  CompWell3  =  probe20  © 
probe51.  Comp  Well  represents  the  computational  well,  which 
is  realized  by  combining  the  discretized  binary  output  of  the 
microplate  wells. 

Prior  to  fabrication  and  experimental  testing,  we  can  com¬ 
putationally  “test”  the  gate  using  hybridization  thermody¬ 
namics  to  predict  the  fluorescence  response  of  our  gate  to 
target  sequences  and  possibly  mutated  forms  of  the  target 
[5] .  For  the  YESi34  gate,  which  would  bind  the  complement 
of  probe  34,  we  used  a  set  of  optimized  fit  parameters  to 
predict  the  response  of  the  gate  to  its  complementary  tar¬ 
get  sequence  and  fifteen  mutant  inputs  (Figure  1).  For  the 
region  identification  problem,  we  desire  gates  that  can  accu¬ 
rately  distinguish  the  target  from  a  non-target;  the  gates  do 
not  need  to  exhibit  a  distinct,  mutation-dependent  fluores¬ 
cence  signature.  Based  on  our  initial  predictive  simulation 
of  the  YESi34  gate,  it  seems  to  be  a  viable  gate  and  would 
potentially  pass  on  to  the  fabrication  process.  We  can  per¬ 
form  similar  tests  on  each  generated  gate. 

We  used  the  Oligonucleotide  Modeling  Platform  (OMP)  from 
DNA  Software  to  compute  parameters  to  predict  the  re¬ 
sponse  of  individual  deoxyribozyme  gates  to  input  probes 
and  to  simulate  the  algorithmic  performance  of  the  sys- 


Figure  1:  Predicted  ribozyme  fluorescence 

output  for  the  designed  YES;34  gate  (5’- 
CTCGTCTGTTCCGAT-3’)  and  corresponding 
full  complement  plus  15  SNP-containing  input 
target  sequences.  Fluorescence  values  given  in 
arbitrary  units. 


tem.  Modeling  and  simulation  enables  us  to  detect  unde¬ 
sirable  hybridizations  that  may  occur  in  the  presence  of  in¬ 
put  probes  or  when  combined  to  form  computational  wells. 
Upon  detection  we  can  return  to  our  algorithm  generation 
step  and  produce  a  more  realizable  algorithm.  This  reduces 
the  probability  of  fabricating  faulty  systems,  thus  reducing 
final  cost. 

2.4  Realization  of  Computational  Biosensor 

We  implement  a  single-gate  (YES  gate)  only  system  and 
a  multi-gate  (YES-gate/AND-NOT  gate)  system.  Figure  2 
shows  single  and  double  input  deoxyribozyme  gates.  Single 


YESiA  (E6>  Deoxyribozyme  Gate 


iAANDiB  (8-17)  Deoxyribozyme  Gate 


Enzymatic  Module 


Stem-loot»  Module 
(Target  seg  binding) 


Enzymatic  Module 
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/  1 3 


Input  A  Binding 


Input  B  Binding 


Substi ate  ST:  5'  TAMRA -TCACTATrAGGAAGAG-BH2  3’ 
Target  sequence:  5'-  GCTTATAATCGTATA  -3' 


Substrate  5'  FAM-TAGTAACTrAGAGAT CAT -BH1 
Target  sequences:  5'-  GCTTATAATCGTATA -3' 

5'-  ACATTTGAGCACTTA  -3' 


Figure  2:  YES^a  (E6)  and  ;aAND;b  (8-17)  Gates. 

gates  use  only  one  probe,  while  multi-gates  are  composed  of 
two  or  three  probes.  The  AND-NOT  gate  implementation 
uses  two  AND-NOT  gates  to  implement  the  exclusive-OR 


Table  1:  Probe  composition  of  “sample”  test  se¬ 
quences  (HK=Hong  Kong,  Indo=Indonesia)  . 


Seq 

P4 

P20 

P23 

P24 

P34 

P43 

P48 

P50 

P51 

Egypt 

1 

0 

0 

0 

0 

0 

0 

0 

0 

HK 

0 

0 

0 

0 

0 

1 

1 

1 

1 

Indo-1 

0 

1 

1 

1 

1 

0 

0 

0 

0 

Indo-2 

0 

1 

0 

0 

1 

0 

0 

0 

0 

Indo-3 

0 

1 

1 

1 

0 

0 

0 

0 

0 

(©)  function.  Currently  we  perform  the  exclusive-OR  in 
our  post-processing  step.  But  the  AND-NOT  gate  demon¬ 
strates  our  ability  to  perform  the  exclusive-OR  operation  in 
a  microplate  well.  Experimental  tests  were  run  to  assess  the 
computational  biosensing  platform: 

2.5  Experimental  Design 

The  following  experimental  tests  were  run  to  assess  the  in¬ 
telligent  biosensing  platform: 


Control  1:  Test  each  of  the  9  YES^a  gates  to  ensure  oper¬ 
ability  and  to  produce  a  reference  signal. 

Control  2:  Test  each  of  the  4  AND-NOT  gates  to  ensure 
operability  and  produce  reference  signal. 

Control  3:  Test  each  XOR  (consists  of  two  AND-NOT  gates 
in  a  single  well 

probei  ©  probe2  =  probeiAND  —  NOTprobe2 

©  probei  AN D  —  N OTprobe 2 

where  +  represents  OR  function. 

•  Testl:  Probei  absent,  Probe2  absent 

•  Test2:  Probei  absent,  Probe2  present 

•  Test3:  Probei  present,  Probe2  absent 

•  Test4:  Probei  present,  Probe2  present 

Classify:  For  the  regional  subtype  classification  experiments, 
we  add  a  mixture  containing  all  the  probes  indicated 
by  a  1  (the  probes  are  subsequence  found  in  the  sample 
sequences  downloaded  from  NCBIs  Influenza  Virus  Re¬ 
source  [1])  into  each  of  the  microplate  wells  (subwells) 
to  simulate  the  test  “sample”.  Table  1  below  shows  the 
“samples”  and  the  probes  (P;)  present  in  the  sample. 

3.  EXPERIMENTAL  RESULTS 

3.1  Feasibility  demonstration  using  a  flourescence-based 
laboratory  platform. 

y  We  devised  a  fluorescence-based  laboratory  detection  plat¬ 
form,  designed,  and  constructed  all  required  deoxyribozyme 
gates  to  implement  a  rapid  diagnostic  system  for  subtype 
classification  for  avian  influenza  strain  H5N1.  Our  exper¬ 
imental  setup  consists  of  a  55-6071  detection  volume  con¬ 
taining:  reaction  buffer,  deoxyribozyme  gates,  fluorophore- 
labeled  substrates,  and  input  sequences.  Fluorescence  was 
measured  with  a  FluoDiaT70  Microplate  Reader  (PTI,  Inc.) 
using  black  384- well  microplates  (OptiPlate-384F,  Perkin- 
Elmer)  at  room  temperature. 


Table  2:  Syndrome  classification  patterns  for  sub- 
type  classification. 


Subtype  Class 

Comp  Welli 

CompWelU 

ConipWelU 

Egypt 

0 

1 

0 

Hong  Kong 

1 

0 

1 

Indonesia 

1 

1 

1 

3.2  Classification  via  computational  post-processing 

In  comparison  to  traditional  liybridization-based  sensors, 
the  output  of  the  computational  biosensor  requires  limited 
post-processing  to  identify  the  subtype  category  to  which  the 
sample  nucleic  acid  sequence  belongs.  We  first  discretize  the 
fluorescence  output  from  each  microplate  well  (subwell)  to 
a  binary  value  by  comparing  the  slope  of  the  linear  regres¬ 
sion  line  of  the  control  to  that  of  the  sample.  If  the  slope 
of  the  sample  is  equal  to  or  greater  than  |  of  the  slope  of 
the  control,  that  subwell  is  scored  as  a  binary  1,  else  it  is 
scored  as  a  binary  0.  The  binary  subwell  values  are  used  to 
compute  the  value  of  each  of  the  three  Comp  Wells.  The  pat¬ 
tern  of  the  computational  wells  determines  what  group  the 
sequence  belongs.  Table  2  shows  the  expected  binary  pat¬ 
terns  for  each  group.  Using  the  binary  patterns,  we  correctly 
identified  the  subtype  of  97.3%  of  the  “sample”  sequences. 

4.  CONCLUSION 

We  have  developed  a  platform  for  the  de  novo  design  and 
realization  of  a  computational  biosensor  based  on  deoxyri- 
bozyme  logic  gates.  The  biosensor  system  is  able  to  detect 
genetic  modifications  in  the  target  and  perform  complex  de¬ 
tection  tasks,  reducing  reliance  on  classical  computing  for 
output  interpretation.  The  ability  to  compute  differentiates 
this  system  from  other  molecular  beacon  based  systems.  By 
increasing  the  complexity  of  the  algorithm  (or  incorporating 
more  computational  wells)  we  can  potentially  increase  the 
sensitivity  and  specificity  of  the  biosensor,  hence  increasing 
accuracy  by  increasing  computation. 

Our  deoxyribozyme-based  computational  sensor  system  can 
provide  rapid  detection  (in  optical  detection  we  were  able 
to  distinguish  target  and  mutated  target  within  minutes), 
will  not  require  PCR  for  target  amplification,  can  simulta¬ 
neously  detect  and  classify  in  vitro,  and  is  favorable  for  a 
field  deployable  platform.  This  de  novo  platform  provides 
a  unique  capability  that  can  help  transform  the  way  we  de¬ 
tect  biological  agents  and  protect  the  Future  Soldier  against 
microbial  WMDs. 

We  are  investigating  immobilization  methods  for  electro¬ 
chemical  detection  of  deoxyribozyme  gate  activity.  The  elec¬ 
trochemical  detection  platform  will  produce  differential  cur¬ 
rents  to  indicate  positive  or  negative  algorithm  outcomes. 
This  will  obviate  our  need  for  a  fluorescence  reader,  thus 
permitting  the  development  of  a  field  deployable  rapid  de¬ 
tection  system 
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